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EXTRACT A DATABASE RECORD FROM A 
STRUCTURED LITERATURE DATABASE 
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PARSE THE DATABASE RECORD TO EXTRACT ONE 

OR MORE INDIVIDUAL INFORMATION FIELDS 
INCLUDING A SET OF CHEMICAL OR BIOLOGICAL 
MOLECULE NAMES 
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FILTER THE EXTRACTED SET OF CHEMICAL OR 
BIOLOGICAL MOLECULE NAMES TO CREATE A 
FILTERED SET OF CHEMICAL OR BIOLOGICAL 
MOLECULE NAMES 
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FILTERED SET 
STORED 
IN AN INFERENCE 
DATABASE? 



YES 




■NO — i 
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TO B 
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STORE ANY NEW CHEMICAL OR BIOLOGICAL 
MOLECULE NAMES FROM THE FILTERED SET IN 
THE IN THE INFERENCE DATABASE AND SET A 
CO-OCCURRENCE COUNT TO A START VALUE 
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F=0 



INCREMENT CO-OCCURRENCE COUNTS FOR 
PAIRS OF CHEMICAL OR BIOLOGICAL MOLECULE 
NAMES IN THE INFERENCE DATABASE THAT 
CO-OCCUR 
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DONE WITH 
UNIQUE DATABASE 
RECORDS? 



YES 
▼ 
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CONSTRUCT AN OPTIONAL CONNECTION 
NETWORK USING ONE OR MORE DATABASE 
RECORDS FROM THE INFERENCE DATABASE 
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APPLY ONE OR MORE ANALYSIS METHODS TO 
DETERMINE POSSIBLE INFERENCES REGARDING 
CHEMICAL OR BIOLOGICAL MOLECULES 
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GENERATE AUTOMATICALLY ONE OR MORE 
INFERENCES REGARDING CHEMICAL OR 
BIOLOGICAL MOLECULES 



( END J 
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FIG. 3 
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AU - MARTINEZ R 
AU - EDWARDS CA 
Tl - EXPRESSION, PURIFICATION 
AND FUNCTIONAL 
CHARACTERIZATION OF THE 
DNA-BINDING DOMAIN OF THE 
HERPES SIMPLEX VIRUS TYPE 1 
UL9 PROTEIN. 
RN - EC 3.4.21.5 (THROMBIN) 
RN - 0 (VIRAL PROTEINS) 
RN - 115004-77-8- (HERPES SIMPLEX 

VIRUS TYPE 1 PROTEIN UL9) 
RN - 9007-49-2 (DNA) 
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THROMBIN 
VIRAL PROTEINS 
HERPES SIMPLEX VIRUS TYPE 1 
PROTEIN UL9 

DNA 
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THROMBIN 

HERPES SIMPLEX VIRUS TYPE 1 
PROTEIN UL9 

DNA 
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FILTER 
WORDS: 
VIRAL 
PROTEINS 

V~ 

74 



INFERENCE DATABASE 



CO-OCCURRENCE COUNTS 



ID NAME 

1 THROMBIN 

2 HERPES SIMPLEX... 

3 DNA 



ID1 


ID2 


COUNT 


1 


2 


12 


1 


3 


4 


2 


3 
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INFERENCE: HERPES SIMPLEX VIRUS TYPE 1 
PROTEIN UL9 INTERACTS WITH DNA 
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FIG. 4 

(start) 
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CREATING A CONNECTION NETWORK FROM AN 
INFERENCE DATABASE, WHERE THE CONNECTION 

NETWORK INCLUDES TWO OR MORE NODES 
CONNECTED BY ONE OR MORE ARCS, WHERE THE 
ONE OR MORE ARCS REPRESENTS CO- 
OCCURRENCES BETWEEN CHEMICAL OR 
BIOLOGICAL MOLECULES, WHERE THE 
INFERENCE DATABASE INCLUDES DATABASE 
RECORDS WITH ONE OR MORE INFERENCE 
ASSOCIATIONS 
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APPLY ONE OR MORE ANALYSIS METHODS TO 
THE CONNECTION NETWORK TO DETERMINE ANY 
TRIVIAL INFERENCE ASSOCIATIONS 
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DELETE DATABASE RECORDS FROM THE 
INFERENCE DATABASE DETERMINED TO INCLUDE 
TRIVIAL INFERENCE ASSOCIATIONS, THEREBY 
IMPROVING INFERENCE KNOWLEDGE IN THE 
INFERENCE DATABASE 



Q END ^ 
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